Sentence extraction and rhetorical classification for flexible abstracts
نویسندگان
چکیده
Knowledge about he discourse-level structure of a scientific article isuseful for flexible and sub-domain independent automatic abstraction. We are interested in the automatic identification of content units ("argumentative entities") in the source text, such as GOAL OR PROBLEM STATEMENT, CONCLUSIONS and RESULTS. In this paper, we present an extension fKupiec et al.’s methodology fortrainable statistical sentence xtraction (1995). Our extension additionally classifies theextracted s ntenccs a cording to their argumentative status; because only low-level properties of the sentence are taken into account and no external knowledge sources other than meta-level linguistic ones are used, it achieves robustness and partial domain-independence.
منابع مشابه
Argumentative Classiication of Extracted Sentences as a Rst Step towards Exible Abstracting
Knowledge about the rhetorical structure of a text is useful for automatic abstraction. We are interested in the automatic extraction of rhetorical units from the source text, units such as Problem Statement, Conclusions and Results. We want to use such extracts to generate high-compression abstracts of scientiic articles. In this paper, we present an extension of Kupiec, Pedersen and Chen's (1...
متن کاملIdentifying Sections in Scientific Abstracts using Conditional Random Fields
OBJECTIVE: The prior knowledge about the rhetorical structure of scientific abstracts is useful for various text-mining tasks such as information extraction, information retrieval, and automatic summarization. This paper presents a novel approach to categorize sentences in scientific abstracts into four sections, objective, methods, results, and conclusions. METHOD: Formalizing the categorizati...
متن کاملSentence Extraction and Rhetorical Classiication for Exible Abstracts
Knowledge about the discourse-level structure of a sci-entiic article is useful for exible and sub-domain independent automatic abstraction. We are interested in the automatic identiication of content units (\argu-mentative entities") in the source text, such as Goal or Problem Statement, Conclusions and Results. In this paper, we present an extension of Ku-piec et al.'s methodology for trainab...
متن کاملRhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
The relevance of automatically identifying rhetorical moves in scientific texts has been widely acknowledged in the literature. This study focuses on abstracts of standard research papers written in English and aims to tackle a fundamental limitation of current machine-learning classifiers: they are mono-labeled, that is, a sentence can only be assigned one single label. However, such approach ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002